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Abstract 

In this paper we study non-interactive correlation distillation (NICD), a generalization of noise sen- 
sitivity previously considered in I5l l31ll39l . We extend the model to NICD on trees. In this model there 
is a fixed undirected tree with players at some of the nodes. One node is given a uniformly random 
string and this string is distributed throughout the network, with the edges of the tree acting as inde- 
pendent binary symmetric channels. The goal of the players is to agree on a shared random bit without 
communicating. 

Our new contributions include the following: 

• In the case of a fc-leaf star graph (the model considered in 1 3 1 1), we resolve the open question 
of whether the success probability must go to zero as k — > oo. We show that this is indeed the 
case and provide matching upper and lower bounds on the asymptotically optimal rate (a slowly- 
decaying polynomial). 

• In the case of the fc-vertex path graph, we show that it is always optimal for all players to use the 
same 1 -bit function. 

• In the general case we show that all players should use monotone functions. We also show, some- 
what surprisingly, that for certain trees it is better if not all players use the same function. 

Our techniques include the use of the reverse Bonami-Beckner inequality. Although the usual Bonami- 
Beckner has been frequently used before, its reverse counterpart seems very little-known; To demonstrate 
its strength, we use it to prove a new isoperimetric inequality for the discrete cube and a new result on the 
mixing of short random walks on the cube. Another tool that we need is a tight bound on the probability 
that a Markov chain stays inside certain sets; we prove a new theorem generalizing and strengthening 
previous such bounds |2|[3]|6). O n the probabilistic side, we use the "reflection principle" and the FKG 
and related inequalities in order to study the problem on general trees. 
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1 Introduction 



1.1 Non-interactive correlation — the problem and previous work 

Our main topic in this paper is the problem of non-interactive correlation distillation (NICD), previously 
considered in l51 l3~Tll3*9l . In its most general form the problem involves k players who receive noisy copies 
of a uniformly random bit string of length n. The players wish to agree on a single random bit but are not 
allowed to communicate. The problem is to understand the extent to which the players can successfully 
distil the correlations in their strings into a shared random bit. This problem is relevant for cryptographic 
information reconciliation, random beacons in cryptography and security, and coding theory; see [39]. 

In its most basic form, the problem involves only two players; the first gets a uniformly random string 
x and the second gets a copy y in which each bit of x is flipped independently with probability e. If the 
players try to agree on a shared bit by applying the same Boolean function / to their strings, they will 
fail with probability P[f(x) ^ f(y)]- This quantity is known as the noise sensitivity of f at e, and the 
study of noise sensitivity has played an important role in several areas of mathematics and computer science 
(e.g., inapproximability \26\, learning theory lH71l30l . hardness amplification ll33l . mixing of short random 
walks [27 1, percolation [10]; see also [34 1). In [5|, Alon, Maurer, and Wigderson showed that if the players 
want to use a balanced function /, no improvement over the naive strategy of letting f(x) = x\ can be 
achieved. 

The paper [31 1 generalized from the two-player problem NICD to a k -player problem, in which a uni- 
formly random string x of length n is chosen, k players receive independent e-corrupted copies, and they 
apply (possibly different) balanced Boolean functions to their strings, hoping that all output bits agree. This 
generalization is equivalent to studying high norms of the Bonami-Beckner operator applied to Boolean 
functions (i.e., ||T p /||fc); see Section|3]for definitions. The results in [31 1 include: optimal protocols involve 
all players using the same function; optimal functions are always monotone; for k = 3 the first-bit ('dicta- 
tor') is best; for fixed e and fixed n and k — > do, all players should use the majority function; and, for fixed 
n and k and e — > or e — ► 1/2 dictator is best. 

Later, Yang ll39l considered a different generalization of NICD, in which there are only two players but 
the corruption model is different from the "binary symmetric channel" noise considered previously. Yang 
showed that for certain more general noise models, it is still the case that the dictator function is optimal; he 
also showed an upper bound on the players' success rate in the erasure model. 

1.2 NICD on trees; our results 

In this paper we propose a natural generalization of the NICD models of [5j|^, extending to a tree topol- 
ogy. In our generalization we have a network in the form of a tree; k of the nodes have a 'player' located 
on them. One node broadcasts a truly random string of length n. The string follows the edges of the trees 
and eventually reaches all the nodes. Each edge of the tree independently introduces some noise, acting as a 
binary symmetric channel with some fixed crossover probability e. Upon receiving their strings, each player 
applies a balanced Boolean function, producing one output bit. As usual, the goal of the players is to agree 
on a shared random bit without any further communication; the protocol is successful if all k parties output 
the same bit. (For formal definitions, see Section |2]) Note that the problem considered in [31] is just NICD 
on the star graph of k + 1 nodes with the players at the k leaves. 

We now describe our new results: 

The /oleaf star graph: We first study the same A: -player star problem considered in I3T1 . Although 
this paper found maximizing protocols in certain asymptotic scenarios for the parameters k, n, and e, the 
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authors left open what is arguably the most interesting setting: e fixed, k growing arbitrarily large, and n 
unbounded in terms of e and k. Although it is natural to guess that the success rate of the players must go 
to zero exponentially fast in terms of k, this turns out not to be the case; [31 1 notes that if all players apply 
the majority function (with n large enough) then they succeed with probability Q,{k~ c ^) for some finite 
constant C(e) (the estimate [31 1 provides is not sharp). lOTI left as a major open problem to prove that the 
success probability goes to as k — > oo. 

In this paper we solve this problem. In Theorem 14. II we show that the success probability must indeed 
go to zero as k — > oo. Our upper bound is a slowly-decaying polynomial. Moreover, we provide a matching 
lower bound: this follows from a tight analysis of the majority protocol. The proof of our upper bound 
depends crucially on the reverse Bonami-Beckner inequality, an important tool that will be described later. 

The £>vertex path graph: In the case of NICD on the path graph, we prove in Theorem 15.11 that in 
the optimal protocol all players should use the same 1-bit function. In order to prove this, we prove in 
Theorem 15.41 a new tight bound on the probability that a Markov chain stays inside certain sets. Our 
theorem generalizes and strengthens previous work [2||3j|6l. 

Arbitrary trees: In this general case, we show in Theorem l6.3l fhat there always exists an optimal protocol 
in which all players use monotone functions. Our analysis uses methods of discrete symmetrization together 
with the FKG correlation inequality. 

In Theorem 16.21 we show that for certain trees it is better if not all players use the same function. This 
might be somewhat surprising: after all, if all players wish to obtain the same result, won't they be better 
off using the same function? The intuitive reason the answer to this is negative can be explained by Figure 
^ players on the path and players on the star each 'wish' to use a different function. Those on the star wish 
to use the majority function and those on the path wish to use a dictator function. Indeed, we will show that 
this strategy yields better success probability than any strategy in which all players use the same function. 

1.3 The reverse Bonami-Beckner inequality 

We would like to highlight the use of the reverse Bonami-Beckner inequality, mentioned above. Let us 
start by describing the original Bonami-Beckner inequality, see Theorem 13. II This inequality considers an 
operator known as the Bonami-Beckner operator (see Section^. It says that some high norm of the result of 
the Bonami-Beckner operator applied to a function can be upper-bounded by some low norm of the original 
function. Its main strength is in its ability to relate two different norms; this is the reason it is often referred 
to as a hypercontractive inequality. The inequality was originally proved by Bonami in 1970 [12] and then 
independently by Beckner in 1973 [ 8 1. It was first used to analyze discrete problems in a a remarkable paper 
by Kahn, Kalai and Linial 1271 where they considered the influence of variables on Boolean functions. 
The inequality has proved to be of great importance in the study of combinatorics of {0, l} n fl31fl6ll22l . 
percolation and random graphs EH |23l [lOl [TU and many other applications l9ll4ll3^l7ll331[T^[T9ll28ll33l. 

Far less well-known is the fact that the Bonami-Beckner inequality admits a reversed form. This reversed 
form was first proved by Borell [13] in 1982. Unlike the original inequality, the reverse inequality says that 
some low norm of the Bonami-Beckner operator applied to a non-negative function can be bounded below 
by some higher norm of the original function. Moreover, the norms involved in the reverse inequality are 
all at most 1 while the norms in the original inequality are all at least 1. A final difference between the two 
inequalities is that in the reverse inequality we need to assume that the function is non-negative. 

We are not aware of any previous uses of the reverse Bonami-Beckner inequality for the study of dis- 
crete problems. The inequality seems very promising and we hope it will prove useful in the future. To 
demonstrate its strength, we provide two applications: 
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Isoperimetric inequality on the discrete cube: As a corollary of the reverse Bonami-Beckner inequality, 
we obtain in Theorem 13 .41 an isoperimetric inequality on the discrete cube. Although it is a simple corollary, 
we believe that the isoperimetric inequality is interesting. It is also used later to give a sort of hitting time 
upper-bound for short random walks. In order to illustrate it, let us consider two subsets S, T C {— 1, l} n 
each containing a constant fraction a of the 2 n elements of the discrete cube. We now perform the following 
experiment: we choose a random element of S and flip each of its n coordinates with probability e for some 
small e. What is the probability that the resulting element is in T? Our isoperimetric inequality implies that 
it is at least some constant independent of n. For example, given any two sets with fractional size 1/3, the 
probability that flipping each coordinate with probability .3 takes a random point chosen from the first set 
into the second set is at least (1/3) 14 /' 6 « 7.7%. We also show that our bound is close to tight. Namely, 
we analyze the above probability for diametrically opposed Hamming balls and show that it is close to our 
lower bound. 

Short random walks: Our second application in Proposition l3.6l is to short random walks on the discrete 
cube. Consider the following scenario. We have two sets S,T C {—1, 1}™ of size at least a2 n each. We 
start a walk from a random element of the set S and at each time step proceed with probability 1/2 to one 
of its neighbors which we pick randomly. Let rn be the length of the random walk. What is the probability 
that the random walk terminates in T? If r = C log n for a large enough constant C then it is known that 
the random walk mixes and therefore we are guaranteed to be in T with probability roughly a. However, 
what happens if r is, say, 0.2? Notice that rn is then less than the diameter of the cube! For certain sets 
S, the random walk might have zero probability to reach certain vertices, but if a is at least, say, a constant 
then there will be some nonzero probability of ending in T. We bound from below the probability that the 
walk ends in T by a function of a and r only. For example, for r = 0.2, we obtain a bound of roughly 
a 10 . The proof crucially depends on the reverse Bonami-Beckner inequality; to the best of our knowledge, 
known techniques, such as spectral methods, cannot yield a similar bound. 

2 Preliminaries 

We now formally define the problem of "non-interactive correlation distillation (NICD) on trees with the 
binary symmetric channel (BSC)." In general we have four parameters. The first is T, an undirected tree 
giving the geometry of the problem. Later the vertices of T will become labeled by binary strings, and 
the edges of T will be thought of as independent binary symmetric channels. The second parameter of the 
problem is < p < 1 which gives the correlation of bits on opposite sides of a channel. By this we mean 
that if a bit string x 6 {—1, l} n passes through the channel producing the bit string y G { — 1, 1}" then 
= p independently for each i. We say that y is a p-correlated copy of x. We will also sometimes 
refer to e = ^ — \p £ (0, i), which is the probability with which a bit gets flipped — i.e., the crossover 
probability of the channel. The third parameter of the problem is n, the number of bits in the string at every 
vertex of T. The fourth parameter of the problem is a subset of the vertex set of T, which we denote by S. 
We refer to the S as the set of players. Frequently S is simply all of V(T), the vertices of T. 
To summarize, an instance of the NICD on trees problem is parameterized by: 

1. T, an undirected tree; 

2. /) G (0,1), the correlation parameter; 

3. n > 1, the string length; and, 

4. S C V(T), the set of players. 
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Given an instance, the following process happens. Some vertex u of T is given a uniformly random 
string x( u > G {—1, l} n - Then this string is passed through the BSC edges of T so that every vertex of T 
becomes labeled by a random string in {—1, 1}™. It is easy to see that the choice of u does not matter, in 
the sense that the resulting joint probability distribution on strings for all vertices is the same regardless of 
u. Formally speaking, we have n independent copies of a "tree-indexed Markov chain;" or a "Markov chain 
on a tree" [24]. The index set is V(T) and the probability measure P on a £ {— 1, 1} V ^ is defined by 

Y(a) = \{\ + \ P ) A ^{\-\ P ) B{a \ 

where A(a) is the number of pairs of neighbors where a agrees and B(a) is the number of pairs of neighbors 
where a disagrees. 

Once the strings are distributed on the vertices of T, the player at the vertex v £ S looks at the string 
x( v ' and applies a (pre-selected) Boolean function f v : {—1, l} n — ► {—1, 1}- The goal of the players is to 
maximize the probability that the bits f v (x^ v >) are identical for all v G S. In order to rule out the trivial 
solutions of constant functions and to model the problem of flipping a shared random coin, we insist that 
all functions f v be balanced; i.e., have equal probability of being —1 or 1. As noted in ll3D . this does 
not necessarily ensure that when all players agree on a bit it is conditionally equally likely to be —1 or 
1; however, if the functions are in addition antisymmetric, this property does hold. We call a collection 
of balanced functions (f v ) v es a protocol for the players S, and we call this protocol simple if all of the 
functions are the same. 

To conclude our notation, we write V(T, p, n, S, (f v )ves) f° r the probability that the protocol succeeds 
- i.e., that all players output the same bit. When the protocol is simple we write merely V(T, p, n, S, /). 
Our goal is to study the maximum this probability can be over all choices of protocols. We denote by 

M(T,p,n,S) = sup V(T,p,n,S,(f v ) veS ), 

(fv)ves 

and define 

M (T, p, S) = sup M (T, p,n,S). 

n 

3 Reverse Bonami-Beckner and applications 

In this section we recall the little-known reverse Bonami-Beckner inequality and obtain as a corollary an 
isoperimetric inequality on the discrete cube. These results will be useful in analyzing the NICD problem 
on the star graph and we believe they are of independent interest. We also obtain a new result about the 
mixing of relatively short random walks on the discrete cube. 

3.1 The reverse Bonami-Beckner inequality 

We begin with a discussion of the Bonami-Beckner inequality. Recall the Bonami-Beckner operator T p , a 
linear operator on the space of functions {—1, l} n — > M defined by 

(T p f)(x) = E[f(y)}, 

where y is a p-correlated copy of x. The usual Bonami-Beckner inequality, first proved by Bonami [ 12 1 and 
later independently by Beckner [ 8 1, is the following: 

Theorem 3.1 Let f : {-1, l} n -» R and q > p > 1. Then 

\\T p f\\ q < \\f\\ p forallQ^p^ip-l) 1 ' 2 /^-!) 1 ' 2 . 
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The reverse Bonami-Beckner inequality is the following: 
Theorem 3.2 Let f : {— 1, l} n — > M— /?e a nonnegative function and let — oo < g < p < 1. 77ze« 

||r„/||, > /or a// < p < (1 - ^/(l _ 5)1/2. (1) 

Note that in this theorem we consider r-norms for r < 1. The case of r = is a removable singularity: 
by || / 1| we mean the geometric mean of /. Note also that since T p is a convolution operator, it is positivity- 
improving for any p < 1; i.e., when / is nonnegative so too is T p f, and if / is further not identically zero, 
then Tpf is everywhere positive. 

The reverse Bonami-Beckner theorem is proved in the same way the usual Bonami-Beckner theorem is 
proved; namely, one proves the inequality in the case of n = 1 by elementary means, and then observes that 
the inequality tensors. Since Borell's original proof may be too compact to be read by some, we provide an 
expanded version of it in Appendix lAl for completeness. 

We will actually need the following "two-function" version of the reverse Bonami-Beckner inequality 
which follows easily from the reverse Bonami-Beckner inequality using the (reverse) Holder inequality (see 
Appendix lAt: 

Corollary 3.3 Letf,g: { — 1,1}™ — ► R— be nonnegative, letx G { — l,l} n be chosen uniformly at random, 
and let y be a p-correlated copy of x. Then for — 00 < p,q < 1, 

B[f(x)g(y)]>\\f\\ p \\g\\ q for all < p < (1 - p) l / 2 {l - q) 1 ' 2 . (2) 
3.2 A new isoperimetric inequality on the discrete cube 

In this subsection we use the reverse Bonami-Beckner inequality to prove an isoperimetric inequality on the 
discrete cube. Let S and T be two subsets of {—1, l} n . Suppose that x G {—1, l} n is chosen uniformly at 
random and y is a p-correlated copy of x. We obtain the following theorem, which gives a lower bound on 
the probability that x G S and y G T as a function of |5|/2 n and |T|/2 n only. 

Theorem 3.4 Let S,T C {-1, l} n with \S\ = exp(-s 2 /2)2 n and \T\ = exp(-i 2 /2)2 n . Let x be chosen 
uniformly at random from { — 1, 1}™ and let y be a p-correlated copy of x. Then 

P[x G S,y£T}> exp I — — — % I . (3) 

Proof: Take / and g to be the 0-1 characteristic functions of S and T, respectively. Then by Corollary 13.31 
for any choice of p, q < 1 with (1 — p)(l — q) = p 2 , we get 

P[x eS,yeT}= E[f(x)g(y)} > \\f\\ p \\g\\ q = ex V (-s 2 /2p) ex P (-t 2 /2g). (4) 

Write p = 1 — pr, q = 1 — p/rin ©, with r > 0. Maximizing the right-hand side as a function of r the 
best choice is r = ((t/s) + p)/(l + p(t/s)) which yields in turn 

(Note that this depends only on the ratio of t and s.) Substituting this choice of r (and hence p and q) into (@J 
yields exp (- \ g!±2gg|tf ) , as claimed. ■ 
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Taking a = exp(— s 2 /2) and a a = exp(— 1 2 /2) we obtain 
1 



-(s 2 + 2pst + t 2 ) = log a — p^/ — 2 log ay/— 2a log a + a log a = log <r(l + 2p\[a + a) 



and therefore 

1 s 2 + 2/>st + i 2 



Pxn . : | - , T (l+2p v ^+a)/(l-p 2 ) 

Pl 2 1-p 2 ' 
Thus, conditioned on starting at S, the probability of ending at T is at least 

(J (l+2p v ^+ a )/(l-p 2 )-l = a {V^+P?/Ci~p' 2 ). 

We thus obtain the following corollary of Theorem 13 .41 

Corollary 3.5 Let S C { — 1,1}" have fractional size a £ [0,1], and let T C { — l,l} n have fractional 
size a a , for a > 0. If x is chosen uniformly at random from S and y is a p-correlated copy of x, then the 
probability that y is in T is at least 

a' 



r (^+p) 2 /(l-P 2 ) 



In particular, if\S\ = \T\ then this probability is at least cr^ 1+p ^^ 1 p \ 

In Subsection !3.4l below we show that the isoperimetric inequality is almost tight. First, we prove a 
similar bound for random walks on the cube. 

3.3 Short random walks on the discrete cube 

We can also prove a result of a similar flavor about short random walks on the discrete cube: 

Proposition 3.6 Let r > be arbitrary and let S and T be two subsets of {—1, l} n . Let a £ [0, 1] be 
the fractional size of S and let a be such that the fractional size of T is a a . Consider a standard random 
walk on the discrete cube that starts from a uniformly random vertex in S and walks for rn steps. Here by 
a standard random walk we mean that at each time step we do nothing with probability 1 /2 and we walk 
along the ith edge with probability l/2n. Let p( T ") (S,T) denote the probability that the walk ends in T. 
Then, 

, v (V^+exp(-T)) 2 /cr (-l+a)/2. 
P (Tn) (S,T) > a l-cxp(-2r) _o( J. 

V rn / 

/ \ l+cxp(-r) 

In particular, when \S\ = \T\ = a2 n then p( Tn \S,T) > ct 1 — p(— 5 - 0(±). 

The Laurent series of is 2/r + r/6 - 0(r 3 ) so for 1/ log n < r < 1 our bound is roughly a < T . 

For the proof we will first need a simple lemma: 

Lemma 3.7 For y > and any < x < y, 

0<e- x -(l-x/y)y <0(l/y). 

Proof: The expression above can be written as 

e ~ x — e v iog(i-x/y) _ 

We have log(l — x/y) < —x/y and hence we obtain the first inequality. For the second inequality, notice 
that if x > O.ly then both expressions are of the form e~ n( - yS> which is certainly 0(l/y). On the other hand, 
if < x < O.ly then there is a constant c such that 

log(l - x/y) > -x/y - cx 2 /y 2 . 
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The Mean Value Theorem implies that for < a < b, e a — e b < e a (b — a). Hence, 

2 —x 

e -x _ e ylog(l-x/ W ) < e -*(_ y l og (l _ x /y) _ x ) < CX 6 , 

y 

The lemma now follows because x 2 e~ x is uniformly bounded for x > 0. ■ 

We now prove Proposition 13.61 The proof uses Fourier analysis; for the required definitions see, e.g., 
ED. 

Proof: Let x be a uniformly random point in {— 1, l} n and y a point generated by taking a random walk 
of length rn starting from x. Let / and g be the 0-1 indicator functions of S and T, respectively, and say 
E[f] = a, E[g] = a a . Then by writing / and g in their Fourier decomposition we obtain that 

a.p(™)(S,T) = P[x G S,y G T] = E[/(x) 5 (z/)] = ^ /(^)g(F)E[x^ W ] 



n 



where [7 and V range over all subsets of {1, . . . , n}. Note that E[x(/yv] is zero unless U = V. Therefore 

u u 
= W)g(U) exp(-r\U\) + £ /(I7)$(tf) [(l " ^)™ - exp(-r|C/|)" 

= (f, T cxp ^ T) g) + £ /(E7)$(E0 [(l " ^) T " " exp(-r|C/|y 



> (/,T cxp( _ r) <7) - max | (l - M)™ - ex P (-r|C7|)| £ |/(^(C/)|. 



u 



By Corollary H31 



^(flT^-^g) >a l-cxp(-2r, 

By Cauchy-Schwarz and Parseval's identity, 



X] 1/(^)^(^)1 < II/II2II5II2 = H/H2NI2 = 
17 

In addition, from Lemma |3~71 with x = r\U\ and y = rn we have that 

O 



max 



-cxpi-riri) 



V 



rn 



Hence, 

/ \ (VH+exp(- T )) 2 /£r (-l+a)/2 

p (Tn) (5, T) > a i—P(-2 T ) _ o 



rn 
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3.4 Tightness of the isoperimetric inequality 

We now show that Theorem l3.4l is almost tight. Suppose x G {—1, l} n is chosen uniformly at random and 
y is a p-correlated copy of x. Let us begin by understanding more about how x and y are distributed. Define 



i p 
p i 



and recall that the density function of the bivariate normal distribution ^s(p) : E 2 
covariance matrix S(p), is given by 



x(o)(x,y) = (2vr)- 1 (l-p 2 ) 2 e xp 



1 x 2 — 2pxy + y 2 



(l-p 2 )-^^) 



2 

y - px 



1-p 2 



i 



(l-p 2 )2 

Here <p denotes the standard normal density function on R, </>(x) = (27r) _1 / 2 e _:c2//2 . 



)>0 



with mean and 



Proposition 3.8 Let x G { — 1, l} n &e chosen uniformly at random, and let y be a p-correlated copy of x. 



LetX 



n 



-1/2 



J27=i x i an d y 



n 



-1/2 



Y^i=i Hi- Then as n — > oo, /?a/r of random variables (X, Y) 



approaches the distribution 0s(p)- ^ s an error bound, we have that for any convex region R 



C 



P[(X,Y) sR]- JJ <h W (x,y)dydx 



<o((i 



l/2 n -l/2 



)• 



Proof: This follows from the Central Limit Theorem (see, e.g., [20 1), noting that for each coordinate i, 
E[x 2 ] = E[y 2 ] = 1, E[xjj/j] = p. The Berry-Esseen-type error bound is proved in Sazonov El P- 10, Item 
6]. ■ 

Using this proposition we can obtain the following result for two diametrically opposed Hamming balls. 

Proposition 3.9 Fix s,t > 0, and let S,T C { — 1,1}™ be diametrically opposed Hamming balls, with 
S = {x: Yli x i — —sn 1 / 2 } and T = {x: Yli x i — tn 1 ^ 2 }. Let x be chosen uniformly at random from 
{ — 1, l} n and let y be a p-correlated copy of x. Then we have 



lim P[x G S,y G T] < 



p" 



2irs(ps + t) 



cxp 



Is 2 + 2pst + t 2 
2 1^7 
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Proof: 



f'OO poo 

lim P[x £ S,y £ T] = <Pz(- P )(x,y) dy dx ( By LemmaETgl) 

rwo ° is Jt 



< 



oo roo 



[ X -^^^-^ V)dydX 

xipx + y) 
Since y — ^ > 1 on x > s, y > t 
s(ps + 1) 

1 cr^f^n. 



oo roo ^ 



- 7^7 X L t ^TT) HxH[ ^=^ ]dzdx 

Using z = px + y and noting > 1 on i > s, z > /)s + 1 

s(ps + i) 




1 



+ - /0 2 Vis / \Jps+t \\JT- p 2 ^ 



xcj){x)dx / 20 — ^=^= dz 



\J\- p 2 ( 1 s 2 + 2pst + t 



2ns(ps + t) 6XP V 2 l-/> 2 
The result follows. ■ 

By the Central Limit Theorem, the set S in the above statement satisfies (see [ 1 26.2.12]), 

lim |S|2" n = -= / e~ x2/2 dx ~ exp(-s 2 /2)/(V2^s). 

n ^°° V27T is 

For large s (i.e., small \S\) this is dominated by exp(— s 2 /2). A similar statement holds for T. This shows 
that Theorem l3.4l is nearly tight. 



4 The best asymptotic success rate in the /c-star 

In this section we consider the NICD problem on the star. Let Star^ denote the star graph on k + 1 vertices 
and let S k denote its k leaf vertices. We shall study the same problem considered in 071; i.e., determining 
.M(Starfc, p, Sk)- Note that it was shown in that paper that the best protocol in this case is always simple 
(i.e., all players should use the same function). 

The following theorem determines rather accurately the asymptotics of A4(Starfc, p, S^)'- 
Theorem 4.1 Fix p £ (0, 1] and let u = u{p) = \ — 1. Then for k — > oo, 

M(st^ k , P ,s k ) = e{k- / ) , 

where G(-) denotes asymptotics to within a subpolynomial (k ^ ) factor. The lower bound is achieved 
asymptotically by the majority function MAJ n with n sufficiently large. 
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Note that if the corruption probability is very small (i.e., p is close to 1), we obtain that the success rate 
only drops off as a very mild function of k. 

Proof of upper bound: We know that all optimal protocols are simple, so assume all players use the same 
balanced function / : {-l,l} n -> {-1,1}- Let F_i = /"H" 1 ) and F i = /"H 1 ) be the sets wh ere 
/ obtains the values —1 and 1 respectively. The center of the star gets a uniformly random string x, and 
then independent p-correlated copies are given to the k leaf players. Let y denote a typical such copy. The 
probability that all players output —1 is thus E x [P[f(y) = — l|x] fc ]. We will show that this probability is 
0{k~ v ). This complete the proof since we can replace / by — / and get the same bound for the probability 
that all players output 1. 

Suppose E x \P[f(y) = -l\x] k ] > 25 for some 5; we will show 5 must be small. Define 

S = {x: P[f(y) = -1 | x] k > 5}. 
By Markov's inequality we must have |5| > 52 n . Now on one hand, by the definition of S, 

P[y G Fi | x G 5] < 1 - 5 l ' k . (5) 

On the other hand, applying Corollary 13.51 with T = Ft an d a < l/log 2 (l/5) < 1/ log (1/(5) (since 
|Fi| = \2 n ), we get 

P[y G F x | x G S] > ^(iog- 1/2 (i/<5)+P) 2 /(^P 2 ). (6) 

Combining © and © yields the desired upper bound on 6 in terms of k, 5 < A;~ z/ +°( 1 ) by the following 
calculations. We have 

1 _ $1/* > < 5(log- 1/2 (V<5)+p) 2 /(l-P 2 ). 
We want to show that the above inequality cannot hold if 

(gcVlog k \ V 
—) ■ (7) 

where c = c(p) is some constant. We will show that if 5 satisfies © and c is sufficiently large then for all 
large k 

gl/k + s Qag-V a O./S)+pf/(l-f) > L 

Note first that 
On the other hand, 

(5 (log- 1 /2(i/5) +p) 2/(i_p2) = r log-i5/(l-p 2 ) . § 2p\og-l/ 2 (l/5)/(l-(?) . £P 2 /(1-P 2 ). (9) 

Note that 

^P 2 /(1-P 2 ) = gl/v > 

and 

^lar^/M) = exp (-^TbgTW) > exp (-7^2 

Finally, 

^log- 1( 5/(l-P 2 ) =exp 



1-p 2 

Thus if c = c(p) is sufficiently large then the left hand side of © is at least - ^ g k . This implies the desired 
contradiction by Q and ©. ■ 
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Proof of lower bound: We will analyze the protocol where all players use M AJ n , similarly to the analysis 
of CIV Our analysis here is more careful resulting in a tighter bound. 

We begin by showing that the probability with which all players agree if they use MAJ n , in the case of 
fixed k and n —* oo, is: 

n lim P(Star fc ,p,n,5 fc ,MAJ n ) = 2i/ 1 / 2 (27r) (i '- 1)/2 f t k I{tf- l dt, (10) 

n~odd J° 

where I = <f> o is the so-called Gaussian isoperimetric function, with (j>{x) = (2it)~ 1 / 2 exp(— x 2 /2) 
and = 4>{t)dt the density and distribution functions of a standard normal random variable respec- 
tively. 

Apply Proposition 13. 81 with X ~ N(0, 1) representing n -1 / 2 times the sum of the bits in the string at 
the star's center, and Y\X ~ N(pX, 1 — p 2 ) representing n -1 / 2 times the sum of the bits in a typical leaf 
player's string. Thus as n — > oo, the probability that all players output +1 when using M AJ n is precisely 



/ \ k 

oo / „ _ \ roo 



<]> j P X \ (j)( x ) dx= I $ ( v 1 l 2 x S j (f>(x) dx. 



oo V — I J —oo 

Since M AJ n is antisymmetric, the probability that all players agree on +1 is the same as the probability they 
all agree on —1. Making the change of variables t = <&(v~ l l 2 x), x = v l / 2 <&~ l (t), dx = v l l 2 I{t)~ l dt, we 
get 

lim P(Star fcl/0 ,n,5 fc ,MAJ n ) = Iv 1 ' 2 / 9{ { >' dt 



,2 



n^oo 
n odd 



2v 1 / 2 {2k)^ 1 V 2 / t k i{ty- l dt 



as claimed. 

We now estimate the integral in ill Ob . It is known (see, e.g., [11|) that I(s) > J(s(l — s)), where 
J(s) = syrn(l/s). We will forego the marginal improvements given by taking the logarithmic term and 
simply use the estimate I(t) > t(l — t). We then get 



f t k i(ty- l dt > f 1 

Jo Jo 



t k {t{l-t)) v - x dt 
- W^T <E6 ' 2 ''' 622]) 

> T(v){k + 2v) v (Stirling approximation). 

Substituting this estimate into dTOl) we get linin^oo 7 ? (Starfc, p, n, S^, MAJ n ) > c{v)k~ u where c{v) > 
depends only on p, as desired. ■ 

We remark that in the upper bound above we have in effect proved the following theorem regarding high 
norms of the Bonami-Beckner operator applied to Boolean functions: 

Theorem 4.2 Let f: {— l,l} n — ► {0,1} and suppose E[f] < 1/2. Then for any fixed p £ (0,1], as 
k -> oo, \\T p f\\ k k < k- u+ °^\ where v=X 1 -\. 
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Since we are trying to bound a high norm of T p f knowing the norms of /, it would seem as though the usual 
Bonami-Beckner inequality would be effective. However this seems not to be the case: a straightforward 
application yields 



only a constant upper bound. 

5 The optimal protocol on the path 

In this section we prove the following theorem which gives a complete solution to the NICD problem on a 
path. In this case, simple dictator protocols are the unique optimal protocols, and any other simple protocol 
is exponentially worse as a function of the number of players. 

Theorem 5.1 • Let Path/% = {vq, v\, . . . , vt} be the path graph of length k, and let S be any subset 
of Pathfc of size at least two. Then simple dictator protocols are the unique optimal protocols for 
"P(Pathfc, p, n, S, (f v )). In particular, if S = {vi , . . . , Vi e } where io < i% < ■ ■ ■ < it, then we have 



• Moreover, for every p and n there exists c = c(p, n) < 1 such that if S = Path^ then for any simple 
protocol f which is not a dictator, 



where T> denotes the dictator function. 
5.1 A bound on inhomogeneous Markov chains 

A crucial component of the proof of Theorem 15. H is a bound on the probability that a Markov chain stays 
inside certain sets. In this subsection, we derive such a bound in a fairly general setting. Moreover, we 
exactly characterize the cases in which the bound is tight. This is a generalization of Theorem 9.2.7 in @ 
and of results in |2j|3l. 

Let us first recall some basic facts concerning reversible Markov chains. Consider an irreducible Markov 
chain on a finite set S. We denote by M = (m(x, y)) x y( - s the matrix of transition probabilities of this chain, 
where m(x,y) is a probability to move in one step from x to y. We will always assume that M is ergodic 
(i.e., irreducible and aperiodic). 

The rule of the chain can be expressed by the simple equation p,\ = poM, where po is a starting 
distribution on S and p,\ is a distribution obtained after one step of the Markov chain (we think of both as row 
vectors). By definition, tti(x, y) = 1. Therefore, the largest eigenvalue of M is 1 and a corresponding 
right eigenvector has all its coordinates equal to 1. Since M is ergodic it has a unique (left/right) eigenvector 
corresponding to an eigenvalue with absolute value 1. We denote the unique right eigenvector (1, . . . , 1)* 
by 1. We denote by it the unique left eigenvector corresponding to the eigenvalue 1 whose coordinate sum 
is 1. 7r is the stationary distribution of the Markov chain. Since we are dealing with a Markov chain whose 
distribution it is not necessarily uniform it will be convenient to work in L 2 (S, 7r). In other words, for any 
two functions / and g on S we define the inner product (/, g) = ^ x&s ^{x)f(x)g{x). The norm of / 



T P f\\ k < ||/IU*-i )+ i=E[/]VC^-i)+i) 
T p f\\i < (i/2) fc /(P 2 ( fe - 1 )+ 1 ) « (1/2) 1 /" 2 , 




P(Path fc , p, n, S, f) < P(Path 



k ,p,n,S,V) C \ s \- 1 



equals ||/|| 2 = V^T) = 



tt{x)P(x). 
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Definition 5.2 A transition matrix M = (rri{x, y)) x y( - s for a Markov chain is reversible with respect to a 
probability distribution ir on S ifir(x)m(x, y) = ir(y)m(y, x) holds for all x, y in S. 

It is known that if M is reversible with respect to it then ir is the stationary distribution of M. Moreover, 
the corresponding operator taking L 2 (S,ir) to itself defined by Mf(x) = ^2 y Tn(x,y)f(y) is self-adjoint, 
i.e., (M f, g) = (/, M g) for all /, g. Thus, it follows that M has a set of complete orthonormal (with respect 
to the inner product defined above) eigenvectors with real eigenvalues. 

Definition 5.3 If M is reversible with respect to tt and Ai < . . . < A r _i < A r = 1 are the eigenvalues of 
M, then the spectral gap of M is defined to be 5 = min { | — 1 — Ai | , 1 1 — A r _i| } . 

For transition matrices M\ , M2 , . . . on the same space S, we can consider the time-inhomogeneous 
Markov chain which at time starts in some state (perhaps randomly) and then jumps using the matrices 
Mi, M2, ... in this order. In this way, Mj will govern the jump from time i — 1 to time i. We write I a 
for the (0-1) indicator function of the set A and tta for the function defined by tta{x) = Ia{x)tv(x) for 
all x. Similarly, we define tt(A) = YlxtA 71 ^)- The following theorem provides a tight estimate on the 
probability that the inhomogeneous Markov chain stays inside certain sets at every step. 

Theorem 5.4 Let M\ , M2 , • • • , be ergodic transition matrices on the state space S, all of which are 
reversible with respect to the same probability measure it. Let 5i > be the spectral gap of matrix Mi and 
let Aq, Ax, . . . ,Ak be subsets of S. 

• If {Xi}i =0 denotes the time-inhomogeneous Markov chain using the matrices Mx, M2, • • • , and 
starting according to distribution tt, then 



K 

»[Xi eAi Vi = 0...k}< 0rO4o)V^fc)Il i 1 - 5 i i 1 ~ 



i=l 



(11) 



• Suppose we further assume that for all i, Si < 1 and that X\ > — 1 + Si (X\ is the smallest eigenvalue 
for the ith chain). Then equality in Hit holds if and only if the sets Ai are the same set A and for all 
i the function I a — ir(A)\ is an eigenfunction of Mi corresponding to the eigenvalue 1 — <5j. 

• Finally, suppose even further that all the chains Mi are identical and that there is some set A' such 
that equality holds as above. Then there exists a constant c = c{M) < 1 such that for all sets Afar 
which a strict inequality holds, we have the stronger inequality 

k 

P[Xi eAi Wi = 0...k] < c k Tr{A) JJ [1 - S(l - n(A))) . 

i=l 

Remark: Notice that if all the sets Ai have 7r-measure at most a < 1 and all the M,'s have spectral gap at 
least S, then the upper bound is bounded above by 

a[a + (l-S)(l-a)] k . 

Hence, the above theorem generalizes the Theorem 9.2.7 in 1 6 1 and strengthens the estimate from [3]. 
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5.2 Proof of Theorem |5J] 



If we look at the NICD process restricted to positions Xi ,Xi 1 , . . . , X{ t , we obtain a time-inhomogeneous 
Markov chain {Xj}j =0 where Xq is uniform on { — 1, l} n and the I transition operators are powers of the 

Bonami-Beckner operator, T^~ io , Tj?~ il , ■ ■ ■ ,Tp l %l ~ x . Equivalently, these operators are T '% t -i , 1 Ua-n, 
. . . , T p i e -i e _ 1 . It is easy to see that the eigenvalues of T p are 1 > p > p 2 > • • • > p n and therefore its 
spectral gap is 1 — p. Now a protocol for the I + 1 players consists simply of I + 1 subsets Aq, . . . , Ag of 
{— 1, 1}" , where Aj is a set of strings in {—1, 1}™ on which the jth player outputs the bit 1. Thus, each Aj 
has size 2 n , and the success probability of this protocol is simply 

P[Xi eAi Vt = 0. . .£] + P[X, G Aj Vt = . . . £}. 

But by Theorem 15 .41 each summand is bounded by 




yielding our desired upper bound. It is easy to check that this is precisely the success probability of a simple 
dictator protocol. 

To complete the proof of the first part it remains to show that every other protocol does strictly worse. 
By the second statement of Theorem 15.41 (and the fact that the simple dictator protocol achieves the upper 
bound in Theorem l5.4t . we can first conclude that any optimal protocol is a simple protocol, i.e., all the sets 
Aj are identical. Let A be the set corresponding to any potentially optimal simple protocol. By Theorem l5.4l 
again the function I a — (\A\2~ n )l = I a — ^1 must be an eigenf unction of T p r for some r corresponding 
to its second largest eigenvalue p r . This implies that / = 21 a — 1 must be a balanced linear function, 
f( x ) = S|5|=i f{S) x s- It is well known (see, e.g., I321 ) that the only such Boolean functions are dictators. 
This completes the proof of the first part. The second part of the theorem follows immediately from the third 
part of Theorem l5.4l ■ 

5.3 Inhomogeneous Markov chains 

In order to prove Theorem 15 .41 we need a lemma that provides a bound on one step of the Markov chain. 

Lemma 5.5 Let M be an ergodic transition matrix for a Markov chain on the set S which is reversible with 
respect to the probability measure ir and which has spectral gap 5 > 0. Let A% and A2 be two subsets of 
S and let P\ and P2 be the corresponding projection operators on L 2 (S, ir) (i.e., Pj/(x) = f(x)lAi(x) for 
every function f on S). Then 

HiW.Pi || < 1 - S (l - \Ar(^i)\A(42)) , 

where the norm on the left is the operator norm for operators from L 2 (S, it) into itself. 

Further, suppose we assume that 5 < 1 and that Ai > —1 + 5. Then equality holds above if and only if 
A\ = A2 and the function Ia 1 — n(Ai)l is an eigenfunction of M corresponding to 1 — 5. 

Proof: Let e±, . . . , e r _i, e r = 1 be an orthonormal basis of right eigenvectors of M with corresponding 
eigenvalues Ai < . . . < A r _i < A r = 1. For a function / on S, denote by supp(/) = {x G S \ f(x) 7^ 0}. 
It is easy to see that 

llfbMPiH = sup {|(/i, Mf 2 ) I : ||/i|| 2 = 1, ||/ 2 || 2 = l,supp(/i) C Ai,supp(/ 3 ) C A 2 }. 
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Given such /i and / 2 , expand them as 



/i = E Uieh fo = ^2 Viei 

i=l i=l 

and observe that for j = 1,2, 

|</,-,l)| = \(f„I Aj )\ < ll/jlbll^lla = (12) 
But now by the orthonormality of the e»'s we have 



r 

\UiVi 



i=l 



< ^ |Ai«j«<| < l)(/ 2 , 1)1 + (1 - 5) ^2 M W 



i=l i<r— 1 



< v / ^)V / ^) + (l-«5)fl- =l-s(l-y^(Ai)^{A 2 



\(h,Mf 2 )\ 



Here we used that ]T\ < 1 which follows from f\ and f 2 having norm 1. 

As for the second part of the lemma, if equality holds then all the derived inequalities must be equalities. 
In particular, from the inequality in (fl2l it follows that for j = 1, 2, fj = ±(l/ y/ir(Aj))lAj. Since 5 < 1 
is assumed it follows from the last inequality in (fT3b that we must also have that ^ \uiVi\ = 1 from which 
we can conclude that \u{\ = \v{\ for all i. Since — 1 + 5 is not an eigenvalue, for the last equality in (fT3l to 
hold we must have that the only nonzero Uj's (or Wj's) correspond to the eigenvalues 1 and 1 — 5. Next, for 
the first inequality in (fT3l to hold, we must have that u = (u%, . . . , u n ) = ±t? = . . . , t> n ) since Aj can 
only be 1 or 1 — 5 and \m\ = \vi\. This gives that fi = ±f 2 and therefore A\ = A 2 . 

Finally, we also get that /i — (/i, 1)1 is an eigenf unction of M corresponding to the eigenvalue 1 — 5. 
To conclude the proof, note that if A\ = A 2 and Ia 1 — 7r(^i)l is an eigenfunction of M corresponding to 
1 — 5, then it is easy to see that when we take fi = f 2 = Iaj — k(A\)\, all inequalities in our proof become 
equalities. ■ 

Proof of Theorem l5.4l Let Pi denote the projection onto Ai, as in Lemma l5~5l It is easy to see that 

P[X 4 E Ai Vi = . . . k] = n Ao P M 1 P 1 M 2 ■ ■ ■ P k ^M k P k I Ak . 

Rewriting in terms of the inner product, this is equal to 

(I Ao , (P M 1 P 1 M 2 • • • P k ^M k P k )I Ak ). 

By Cauchy-Schwarz it is at most 

||I^ || 2 ||/4 fe ||2||PoMiPiM 2 • ■ ■ P k ^M k P k \\, 

where the third factor is the norm of P§M\P\M 2 ■ ■ ■ P k _\M k P k as an operator from L 2 (S,tt) to itself. 
Since Pf = Pi (being a projection), this in turn is equal to 



^(AoWTv(A k )\\(P M l P 1 )(P 1 M 2 P 2 ) ■ ■ ■ (P k ^M k P k )\\. 
By Lemma l531 we have that for all i = 1, . . . , k 



Pi-lM e Pi\\ < 1 - Si 1 - yJ^Ai^y/^Ai)) . 
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Hence 

k k 

I n^MP;)! < n [ i - 6i i 1 - v^i-i)v<Ai 

i=l i=l 

and the first part of the theorem is complete. 

For the second statement note that if we have equality, then we must also have equality for each of the 
norms ||Pj_iMjPj||. This implies by Lemma 1531 that all the sets A\ are the same and that Ij^ — 7r(^4j)l is 
in the 1 — 5i eigenspace of M\ for all i. For the converse, suppose on the other hand that A\ = A for all i 
and 1a — ir(A)l is in the 1 — 5i eigenspace of Mj. Note that 

Pi^MiPtlA = P-iMJa = P^Mi^A)! + (I A ~ vr(A)l)) = P < _ 1 (tt(A)1 + (1 - 8^1 A - vr(A)l)) 
= tt(A)I a + (1 - Si)I A - (1 - Si)n(A)I A = (1 - Si(l - tt(A))I a . 

Since P? = P», we can use induction to show that 

k k 

Tr Ao P MxPxM 2 ■ ■ ■ P k „xM k P k I Ah = 7T A HiP^xMiPi) I a = n(A) II (1 - *<(1 - < A )) . 

1=1 8=1 

completing the proof of the second statement. 

In order to prove the third statement, it suffices to note that if A does not achieve equality and P is the 
corresponding projection, then ||PMP|| < 1 — 5(1 — it (A)). ■ 



6 NICD on general trees 

In this section we give some results for the NICD problem on general trees. First we observe that the 
following statement follows easily from the proof of Theorem 1.3 in [31 1: 

Theorem 6.1 For any NICD instance (T, p, n, S) in which \S\ = 2 or 1 5*| =3 the simple dictator protocols 
constitute all optimal protocols. 



6.1 Example with no simple optimal protocols 

It appears that the problem of NICD in general is quite difficult. In particular, using Theorem l5.1l we show 
that there are instances for which there is no simple optimal protocol. Note the contrast with the case of 
stars where it is proven in [31 1 that there is always a simple optimal protocol. 

Theorem 6.2 There exists an instance (T, p,n,S) for which there is no simple optimal protocol. In fact, 
given any p and any n > 4, there are integers k\ and k% such that ifT is a k\-leaf star together with a path 
of length &2 coming out of the center of the star ( see Figure and S is the full vertex set of T, then this 
instance has no simple optimal protocol. 

Proof: Fix p and n > 4. Recall that we write e = \ — \p and let Bin(3, e) be a binomially distributed 
random variable with parameters 3 and e. As was observed in [31 ], 

P(Star fe ,p,n,5 fc ,MAJ 3 ) > ^P[Bin(3,e) < l] k . 

o 

To see this, note that with probability 1/8 the center of the star gets the string (1,1,1). Since 
P[Bin(3, e) < 1] = (1 — e) 2 (l + 2e) > 1 - e for all e < 1/2, we can pick kx large enough so that 

P(Star fel , p, n, S kl , MAJ 3 ) > 8(1 - e) fcl . 
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Next, by the last statement in Theorem 15.41 there exists C2 = C2{p,n) > 1 such that for all balanced 
non-dictator functions / on n bits 

P(Path fe ,p,n,Path fc ,£>) > ^(Pathfc, p, n, Path fc , /)c£. 

Choose k2 large enough so that 

(l-e) fcl 4 2 > 1. 




Figure 1 : The graph T with fci = 5 and &2 = 3 

Now let T be the graph consisting of a star with k\ leaves and a path of length k,2 coming out of its 
center (see Figure 0, and let S = V(T). We claim that the NICD instance (T,p,n,S) has no simple 
optimal protocol. We first observe that if it did, this protocol would have to be T>, i.e., V(T, p, n, S, f) < 
V(T, p, n, S, V) for all simple protocols / which are not equivalent to dictator. This is because the quantity 
on the right is (1 — e) kl+k ' 2 and the quantity on the left is at most "P(Pathfc 2 , p, n, Pathfc 2 , /) which in turn 
by definition of C2 is at most (1 — e) fc2 /c^ 2 . This is strictly less than (1 — e) fc i+ fc 2 by the choice of ki- 

To complete the proof it remains to show that V is not an optimal protocol. Consider the protocol where 
&2 vertices on the path (including the star's center) use the dictator T> on the first bit and the k\ leaves of 
the star use the protocol MAJ3 on the last three out of n bits. Since n > 4, these vertices use completely 
independent bits from those that vertices on the path are using. We will show that this protocol, which we 
call /, does better than V. 

Let A be the event that all vertices on the path have their first bit being 1 . Let B be the event that each 
of the k\ leaf vertices of the star have 1 as the majority of their last 3 bits. Note that P(A) = |(1 — e) k ' 2 and 
that, by definition of k\, P{B) > 4(1 — e) kl . Now the protocol / succeeds if both A and B occur. Since A 
and B are independent (as distinct bits are used), / succeeds with probability at least 2(1 — e) k2 (l — e) kl 
which is twice the probability that the dictator protocol succeeds. ■ 

Remark: It was not necessary to use the last 3 bits for the k\ vertices; we could have used the first 3 (and 
had n = 3). Then A and B would not be independent but it is easy to show (using the FKG inequality) that 
A and B would then be positively correlated which is all that is needed. 

6.2 Optimal monotone protocols 

Next, we present some general statements about what optimal protocols must look like. Using discrete 
symmetrization together with the FKG inequality we prove the following theorem, which extends one of the 
results in [31 1 from the case of the star to the case of general trees. 

Theorem 6.3 For all NICD instances on trees, there is an optimal protocol in which all players use a 
monotone function. 

One of the tools that we need to prove Theorem 16. 31 is the correlation inequality obtained by Fortuin et 
al. I2T1 which is usually called the FKG inequality. We first recall some basic definitions. 
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Let D be a finite linearly ordered set. Given two strings x, y in D m we write x < y iff X{ < y, for all 
indices 1 < i < to. We denote byxVy and x A y two strings whose ith coordinates are max(xj, yi) and 
min(xj, y^ respectively. A probability measure /i : D m — > R-° is called log-supermodular if 

Kv)K$) <Kv^S) f j,(r ) A5) (14) 

for all r/, <5 G D m . If satisfies (fT4t we will also say that p satisfies the FKG condition. A subset A C D m 
is increasing if whenever x G vl and x < y then also y G A Similarly, ^4 is decreasing if x € A and y < x 
imply that y G A Finally, the measure of ^4 is //(^4) = X^eA The following well known fact is a 
special case of the FKG inequality. 

Proposition 6.4 Let \i : { — 1, l} m — ► K— &e fl log-supermodular probability measure on the discrete cube. 
If A and B are two increasing subsets of { — 1, l} m and C is a decreasing subset then 

Li(AnB)> fi(A)- li(B) and p,(A n G) < p{A) ■ p,{C). 

It is known that in order to prove that fj, satisfies the FKG lattice condition, it suffices to check this for 
"smallest boxes" in the lattice, i.e., for rj and 6 that agree at all but two locations. For completeness we prove 
this here. 

Lemma 6.5 Let n be a measure with full support. Then p, satisfies the FKG condition M4\ if and only if it 
satisfies M4\ for all r\ and 5 that agree at all but two locations. 

Proof: We will prove the non-trivial direction by induction on d = d{rj, 5), the Hamming distance between 
rj and 5. The cases where d(r), S) < 2 follow from the assumption. The proof will proceed by induction on 
d. Let d = d(rj, 6) > 3 and assume the claim holds for all smaller d. We can partition the set of coordinates 
into 3 subsets J = , I{ rj >&} and I{r)<s} > where rj and 5 agree, where rj > 5 and where rj < 5 respectively. 
Without loss of generality \I{ v >s} I > 2. Let i G I{ri>s} an d let rf be obtained from rj by setting rj ■ = 5i and 
letting rfj = rjj otherwise. Then since rf A 5 = rj A 5, 

Li(7 1 A5)rL(r l V5) = ( ii{rf Kb)p(rf V 5) \ ( p(rf)p(r) V S) \ 

The first factor is > 1 by the induction hypothesis since d(5, rj') = d(5, rf) — 1. Note that rf = rj A (j]' V 5) 
and rj V 5 = r] V (i]' V 5) and d(r]', i] V 5) = 1 + \I{ V< &} \ < d. Therefore by induction, the second term is 
also > 1. ■ 

The above tools together with symmetrization now allow us to prove Theorem 16.31 

Proof of Theorem l6.3t Recall that we have a tree T with m vertices, < p < 1, and a probability measure 
P on a G {—1, l} y ( T ) which is defined by 

P(a) = |(| +ip)^>(i-ip)*M 

where A(a) is the number of pairs of neighbors where a agrees and B(a) is the number of pairs of neigh- 
bors where a disagrees. To use Proposition 16.41 we need to show that P is a log-supermodular probability 
measure. 

Note that (TPfl) holds trivially if a < j3 or (3 < a. Thus it suffices to consider the case where there are 
two vertices u, v of T on which a and (3 disagree and that a v = f3 u = 1 and a u = (3 V = —1. If these 
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vertices are not neighbors then by definition of P we have that P(a)P(/3) = P(a V /3)P(a A (5). Similarly, 
if u is a neighbor of v in T, then one can easily check that 

P(a)PQ3) = /HV < 
P(a V /?)P(a A /3) \l + p) ~ 

Hence we conclude that measure P is log-supermodular. 

Let fi,...,fk be the functions used by the parties at nodes S = {v%, . . . , v^}- We will shift the functions 
in the sense of Kleitman's monotone "down-shifting" [29 1. Namely, define functions g\, . . . as follows: 

If fi(-l,x 2 ,...,x n ) = fi(l,x 2 , ■ ■ ■ ,x n ) then we set 

gi(-l,x 2 , ... ,x n ) = gi(l,x 2 , . . . ,x n ) = fi(-l,x 2 , ... ,x n ) = fi(l,x 2 , ■ ■ -,x n ). 

Otherwise, we set gi{— l,x 2 , ■ ■ ■ ,x n ) = —1 and gi(l,x 2 , . . . ,x n ) = 1. We claim that the agreement 
probability for the gi's is at least the agreement probability for the /j's. Repeating this argument for all bit 
locations will prove that there exists an optimal protocol for which all functions are monotone. 

To prove the claim we condition on the value of x 2 , • • • , x n at all the nodes vi and let ctj be the remaining 
bit at For simplicity we will denote the functions of this bit by /, and gi. Note that if there exist i and j 
such that 1) = /i(l) = —1 and fj(—l) = = 1, then the agreement probability for both / and g 
is 0. 

It therefore remains to consider the case where there exists a subset S' C S such that /«(— 1) = = 
1 for all i G S' and /i(-l) + fi(l) for alH G U = S \ S' (the case where = fi(l) = -1 for all 

i G S' can be treated similarly and the case where for all functions /»(— 1) / /i(l) may be decomposed into 
the above two events where S' = 0). Note that in this case the agreement probability for the g's is nothing 
but P(ct; = 1 : i G U) while the agreement probability for the /'s is P(«j = Tj : i G J7), where Tj = —1 if 
1) = 1 and Tj = 1 otherwise. 

Let V C [/ be the set of indices ? such that Tj = — 1 and let = {z G ?7 | = 1}. Let A be the set 
of strings in { — 1, l} m with on = 1 for all i G C/', let i? be the set of strings with on = 1 for alH G U 
and let C be the set of strings with on = — 1 for all i <E U'. Note that .A, i? are increasing sets and C is 
decreasing. Also, since our distribution is symmetric, it is easy to see that P(A) = P(C). Therefore, by the 
FKG inequality, the agreement probability for the g's, namely 

P{A HB)> P(A) • P(B) = P(C) • P(B) > P(C n B), 

is at least as large as for the /'s. ■ 

Remark: The last step in the proof above may be replaced by a more direct calculation showing that in 
fact we have strict inequality unless the sets U', U" are empty. This is similar to the monotonicity proof in 
I3T1 . This implies that every optimal protocol must consist of monotone functions (in general, it may be 
monotone increasing in some coordinates and monotone decreasing in the other coordinates). 

Remark: The above proof works in a much more general setup than just our tree-indexed Markov chain 
case. One can take any FKG measure on {—1, l}" 1 with all marginals having mean 0, take n independent 
copies of this and define everything analogously in this more general framework. The proof of Theorem l6.3l 
extends to this context. 

6.3 Monotonicity in the number of parties 

Our last theorem yields a certain monotonicity when comparing the simple dictator protocol T> and the 
simple protocol MAJ r , which is majority on the first r bits. The result is not very strong - it is interesting 
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mainly because it allows to compare protocols behavior for different number of parties. It shows that if 
MAJ r is a better protocol than dictatorship for k\ parties on the star, then it is also better than dictatorship 
for k 2 parties if k 2 > &i ■ 

Theorem 6.6 Fix p and n and suppose k\ and r are such that 

V(Stai kl , p, n, Star fcl , MAJ r ) > (>) P(Star fcl , p, n, Star fcl , V). 

Then for all k 2 > k\, 

■p(Starfc 2 ,p, n,Starfc 2 , MAJ, r ) > (>) T^Star^, p, n, Starfc 2 , V). 

Note that it suffices to prove the theorem assuming r = n. In order to prove the theorem, we first recall 
the notion of stochastic domination. If 77, 5 G {0,1,..., n} 1 , write 77 ^ S if rji < S{ for all i G /. If v and 
p are two probability measures on {0, 1, ... , n} 1 , we say p stochastically dominates v, written v < p, if 
there exists a probability measure m on {0, 1, ... , n} 1 x {0,1,..., n} 1 whose first and second marginals 
are respectively v and p and such that m is supported on {(77, 5) : 77 H 5}. 

Fix p, n > 3, and any tree T. Let our tree-indexed Markov chain be {x v } v <=t, where x v S {—1, l} n for 
each v G T. Let ^4 C {—1, l} n be the strings which have a majority of l's. Let X v denote the number of 
l's in x v . 

Given S CT, let ps be the conditional distribution of {X v } v& t given n v& s{x v G ^4} (= n v& s{X v > 
n/2}). The following lemma is key and might be of interest in itself. It can be used to prove (perhaps less 
natural) results analogous to Theorem l6.6l for general trees. Its proof will be given later. 

Lemma 6.7 In the above setup, if Si C S 2 C T, we have 

PS! di ps 2 - 

Before proving the lemma or showing how it implies Theorem 16. 61 a few remarks are in order. 

• Note that if {x^} is a Markov chain on {—1, 1}™ with transition matrix T p , then if we let X k be the 
number of l's in x k , then {X^} is also a Markov chain on the state space {0, 1, ... , n} (although it is 
certainly not true in general that a function of a Markov chain is a Markov chain.) In this way, with 
a slight abuse of notation, we can think of T p as a transition matrix for {X^} as well as for {xfc}. 
In particular, given a probability distribution p, on {0, 1, ... , n} we will write pT p for the probability 
measure on {0, 1, ... , n} given by one step of the Markov chain. 

• We next recall the easy fact that the Markov chain T p on {—1, l} n is attractive meaning that if v and 
p are probability measures on {—1, l} n with v ■< p, then it follows that vT p -< pT p . The same is true 
for the Markov chain {X^} on {0, 1, ... , n}. 

Along with these observations, Lemma l6~71 is enough to prove Theorem 16 .61 

Proof: Let vq, v%, . . . , Vk be the vertices of Star^, where vq is the center. Clearly, V(Staik, p, Star^, V) = 
(i + \p) k . On the other hand, a little thought reveals that 

fc-l 

P(Star fc , p, n, Star fc , MAJ n ) = J| (Li vo ,...,vt k t p){ a )-, 
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where u \ v means the distribution of v restricted to the location v (recall that AC {—1, l} n is the strings 
which have a majority of l's). By Lemma 16/71 and the attractivity of the process, the terms (p VOj ... tVi \ vo 
T p )(A) (which do not depend on k as long as I < k) are nondecreasing in I. Therefore if 

P(Star fc , p, n, Star fc , MAJ n ) > (>)(± + \p) k , 

then (/Au 0j . \ Vo T p ){A) > (>)\ + \p which implies in turn that for every k' > k, (pv ,...,v k ,_ 1 \v 
Tp){A) >{>)\ + \p and thus for all k' > k 

V(Stark>,P, n, Star fc /, MAJ n ) > (>)(± + \pf . 



Before proving Lemma l6~71 we recall the definition of positive associativity. If p is a probability measure 
on {0, 1, ... , n} 1 , p is said to be positively associated if any two functions on {0, 1, ... , n} 1 which are in- 
creasing in each coordinate are positively correlated. This is equivalent to the fact that if B C {0, 1, . . . , n} 1 
is an upset, then p conditioned on B is stochastically larger than p. 

Proof of Lemma RT71 It suffices to prove this when 52 is S\ plus an extra vertex z. We claim that for any 
set S, pLs is positively associated. Given this claim, we form ps 2 by first conditioning on H ve si{ x v G A}, 
giving us the measure p$i> an d then further conditioning on x z G A. By the claim, p,g 1 is positively 
associated and hence the last further conditioning on X z G A stochastically increases the measure, giving 

To prove the claim that ps is positively associated, we first claim that the distribution of {X v } v& t, 
which is just a probability measure on {0, 1, ... , n} T , satisfies the FKG lattice condition (flUi . 

Assuming the FKG condition holds for {X v } v& t, it is easy to see that the same inequality holds when 
we condition on the sublattice n v <=s{X v > n/2} (it is crucial here that the set n v& s{X v > n/2} is a 
sublattice meaning that r/, 6 being in this set implies that r] V S and -q A 5 are also in this set). 

The FKG theorem, which says that the FKG lattice condition (for any distributive lattice) implies positive 
association, can now be applied to this conditioned measure to conclude that the conditioned measure has 
positive association, as desired. 

Finally, by Lemma lOl in order to prove that P satisfies the FKG lattice condition, it is enough to check 
this for "smallest boxes" in the lattice, i.e., for -q and 5 that agree at all but two locations. If these two 
locations are not neighbors (i.e., two leaves), it is easy to check that we have equality. If they are neighbors, 
it easily comes down to checking that if a > b and c > d, then 

P(X 1 = c\X = a)P(X 1 = d\X = b)> P(Xi = d\X = a)P{X 1 = c\X = b) 

where {Xq, X\\ is the distribution of our Markov chain on {0, 1, ... , n} restricted to two consecutive times. 
It is straightforward to check that for p G (0,1), the above Markov chain can be embedded into a continuous 
time Markov chain on {0, 1, . . . , n} which only takes steps of size 1. Hence the last claim is a special case 
of Lemma l6"8l ■ 

Lemma 6.8 If {Xt} is a continuous time Markov chain on {0, 1, . . . , n} which only takes steps of size 1, 
then if a > b and c > d, it follows that 

P(Xi =c\X = a) P(Xi =d\X = b)> P(Xi =d\X = a) P(Xx =c\X = b). 

(Of course, by time scaling, X\ can be replaced by any time X t .) 
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Proof: Let R afi be the set of all possible realizations of our Markov chain during [0, 1] starting from a and 
ending in c. Define R a ,d, Rb,c and R^^ analogously. Letting P x denote the measure on paths starting from 
x, we need to show that 

Pa(Ra,c)Pb(Rb,d) > Pa(Ra,d)Pb(Rb,c) 

or equivalently that 

Pa X P b [R a ,c X Rb,d] >Pa>< P b [R a ,d X R b ,c] 

We do this by giving a measure preserving injection from R a ^ x Rb iC to R afi x Rb,d- We can ignore pairs of 
paths where there is a jump in both paths at the same time since these have P a x Pj, measure 0. Given a pair 
of paths in R a ^ x i?^ c , we can switch the paths after their first meeting time. It is clear that this gives an 
injection from R a ^ x Rb jC to i? a c x and the Markov property guarantees that this injection is measure 
preserving, completing the proof. ■ 

7 Conclusions and open questions 

In this paper we have exactly analyzed the NICD problem on the path and asymptotically analyzed the NICD 
problem on the star. However, we have seen that results on more complicated trees may be hard to come by. 
Many problems are still open. We list a few: 

• Is it true that for every tree NICD instance, there is an optimal protocol in which each player uses 
some majority rule? This question was already raised in l3T1l for the special case of the star. 

• Our analysis for the star is quite tight. However, one can ask for more. In particular, what is the best 
bound that can be obtained on 

A4(Star fc ,p, gfc) 

limn^oo "P(Starfc, p, n, Sk, MAJ ra ) 

n odd 

for fixed value of p. Our results show that = k°^. Is it true that lim^oo = 1? 

• Finally, we would like to find more applications of the reverse Bonami-Beckner inequality in computer 
science and combinatorics. 
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A Proof of the reverse Bonami-Beckner inequality 

Borell's proof of the reverse Bonami-Beckner inequality 1131 follows the same lines as the traditional proofs 
of the usual Bonami-Beckner inequality llT2l l8l. Namely, he proves the result in the case n = 1 (i.e., the 
"two-point inequality") and then shows that this can be tensored to produce the full theorem. The usual 
proof of the tensoring is easily modified by replacing Minkowski's inequality with the reverse Minkowski 
inequality [25 Theorem 24]. Hence, it is enough to consider functions / : { — 1, 1} — > R- (i.e., n = 1). 
By monotonicity of norms, it suffices to prove the inequality in the case thatp= (l-p) 1 / 2 /( 1 -^) 1/2 ; i - e - 
p 2 = (1 — p)/(l — q). Finally, it turns out that it suffices to consider the case where < q < p < 1 (see 
Lemma lA.3t . 

Lemma A.l Let f : {—1,1} — ► R-° be a nonnegative function, < q < p < 1, and p 2 = (1 — p)/(l — q). 
Then \\T p f\\ q > \\f\\ p . 
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Proof (Borell): If / is identically zero the lemma is trivial. Otherwise, using homogeneity we may assume 
that f(x) = 1 + ax for some a G [—1, 1]. We shall consider only the case a G (—1, 1); the result at the 
endpoints follows by continuity. Note that T p f(x) = 1 + pax. 
Using the Taylor series expansion for (1 + a) q around 1, we get 



\Tp!\\\ = ^(a+«p) g +(i-«p) 9 ) = ^f(i+f;(")«v)+(i+f;(^( 

\ n =l ^ ' n=l ^ ' 



-a)V) 



n=l v 7 

(Absolute convergence for \a\ < 1 lets us rearrange the series.) Since p > q, it holds for all a; > — 1 that 
(1 + x)^ 9 > 1 + px/q. In particular, from (fT3T > we obtain that 

IWK- (i+E( 2 l)«V") 5/, >i + Ef( 2 9 „) V" (.« 

\ n=l v 7 / n=l 



Similarly to (TT31 we can write 



? = i+£(/„> 2 " <>') 

n=l x 7 

From (Tl6l and (TlTt we see that in order to prove the theorem it suffices to show that for all n > 1 

Simplifying (fT8b we see the inequality 

(g - 1) • • • {q - 2n + l)p 2n > {p - 1) • • • (p - 2n + 1), 

which is equivalent in turn to 

(1 - g) • • • (2n - 1 - g)/> 2n < (1 - p) ■ ■ • (2n - 1 - p). (19) 

Note that we have (1 — p) = (1 — q)p 2 . Inequality (fT8l would follow if we could show that for all m > 2 it 
holds that p(m — q) < (m — p). Taking the square and recalling that p 2 = (1 — p)/(l — q) we obtain the 
inequality 

(1 — p){m — q) 2 < (m — p) 2 (l — q), 

which is equivalent to 

m — 2m + p + q — pq > 0. 
The last inequality holds for all m > 2 thus completing the proof. ■ 



We also prove the two-function version promised in Section ITT1 Recall first the reverse Holder inequal- 
ity l25l Theorem 13] for discrete measure spaces: 

Lemma A.2 Let f and g be nonnegative functions and suppose 1/p + 1/p 1 = 1, where p < 1 (p' = if 
p = 0). Then 

B[fg} = \\fg\\ 1 >\\f\\ p \\g\\ p/ , 

where equality holds if g = plv' , 
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Proof of Corollary 13.31 By definition, the left-hand side of © is ~Ei\fT p g\. We claim it suffices to prove 
© for p = (1 — p) 1 / 2 (l — q) l l 2 . Indeed, otherwise, let r satisfy p = (1 — p) 1//2 (l — r) 1 / 2 and note that 
r > q. Then, assuming © holds for p, r and p we obtain: 

E[/T p <7] > ll/llpll^lr > \\f\\ P \\g\\ q , 

as needed. 

We now assume p = (1 — p) 1//2 (l — g) 1 / 2 . Let p' satisfy 1/p + 1/p' = 1. Applying the reverse 
Holder inequality we get that E[fT p g] > \\f\\p\\T p g\\ p >. Note that, since 1/(1 - p') = 1 - p, the fact that 
p = (1 — p) l / 2 (l — q) 1 / 2 implies p = (1 — g) 1 / 2 (l — p') -1 / 2 . Therefore, using the reverse Bonami-Beckner 
inequality with p' < q < 1, we conclude that 

E[/(x)0(y)] > ll/llpll^Hp/ > ||/||p||^|| ff . 



Lemma A.3 It suffices to prove (Q/or < q < p < 1. 

Proof: Note first that the case p = 1 follows from the case p < 1 by continuity. Recall that 1— p = p 2 {\—q). 
Thus, p > q. Suppose (0 holds for < q < p < 1. Then by continuity we obtain (Q for < q < p < 1. 
From 1 — p = p 2 (l — q), it follows that 1 — </ = 1/(1 — q) = p 2 / (1 —p) = p 2 (l — p'). Therefore if p < 0, 
then p 1 = 1 — 1/(1 — p) > and g' = 1 — p 2 /(l — p) > p' > 0. We now conclude that if / is non-negative, 
then 

||r p /|| g = infills/Hi : ||<7|| g , = 1, 5 > 0} (by reverse Holder) 
= inf{||/T p g||i : \\g\\ q > = l,g > 0} (by reversibility) 

> infdl/llpll^llp' : \\g\\q> = hd > 0} (by reverse Holder) 

> ||/||pinf{y| g /:||^ = l,5>0} = ||/|| p (by Q for < p' < q' < 1). 

We have thus obtained that Q holds for p < 0. The remaining case is p > > q. Let r = and choose 
pi,P2 such that (1 — p) = p\{\ — r) and (1 — r) = p 2 (l — q). Note that < pi, P2 < 1 and that p = p\p2- 
The latter equality implies that T p = T pi T P2 (this is known as the "semi-group property"). Now 

\\T p f\\ q = \\T pi T p J\\ q >\\T p J\\ r >\\f\\ p , 

where the first inequality follows since q < r < and the second since p > r > 0. 

We have thus completed the proof. ■ 
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